Efficient mining of interesting emerging patterns and their effective use in classification

نویسنده

  • Hongjian Fan
چکیده

Knowledge Discovery in Databases (KDD), or Data Mining is used to discover interesting or useful patterns and relationships in data, with an emphasis on large volume of observational databases. Among many other types of information (knowledge) that can be discovered in data, patterns that are expressed in terms of features are popular because they can be understood and used directly by people. The recently proposed Emerging Pattern (EP) is one type of such knowledge patterns. Emerging Patterns are sets of items (conjunctions of attribute values) whose frequency changes significantly from one dataset to another. They are useful as a means of discovering distinctions inherently present amongst a collection of datasets and have been shown to be a powerful method for constructing accurate classifiers. In this doctoral dissertation, we study the following three major problems involved in the discovery of Emerging Patterns and the construction of classification systems based on Emerging Patterns: 1. How to efficiently discover the complete set of Emerging Patterns between two classes of data? 2. Which Emerging Patterns are interesting, namely, which Emerging Patterns are novel, useful and non-trivial? 3. Which Emerging Patterns are useful for classification purpose? And how to use these Emerging Patterns to build efficient and accurate classifiers?

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

High Fuzzy Utility Based Frequent Patterns Mining Approach for Mobile Web Services Sequences

Nowadays high fuzzy utility based pattern mining is an emerging topic in data mining. It refers to discover all patterns having a high utility meeting a user-specified minimum high utility threshold. It comprises extracting patterns which are highly accessed in mobile web service sequences. Different from the traditional fuzzy approach, high fuzzy utility mining considers not only counts of mob...

متن کامل

داده‌کاوی بالینی: مروری بر تکنیک‌های داده‌کاوی در دیابت

Background: Provide a health care service to the patients with diabetes provides useful information that could be used to identify, treatment, following up and prevention of diabetes. Explore and investigation of large volumes of data requires effective and efficient methods for finding hiding patterns in the data. The use of various techniques of data mining in particular Classification and Fr...

متن کامل

Analyzing and Investigating the Use of Electronic Payment Tools in Iran using Data Mining Techniques

In today's world, most financial transactions are carried out using done through electronic instruments and in the context of the Information Technology and Internet. Disregarding the application of new technologies at this field and sufficing to traditional ways, will result in financial loss and customer dissatisfaction. The aim of the present study is surveying and analyzing the use of elect...

متن کامل

Contrast pattern mining and its applications

The ability to distinguish, differentiate and contrastbetween different data sets is a key objective in datamining. Such ability can assist domain experts tounderstand their data, and can help in buildingclassification models. This presentation will introduce theprincipal techniques for contrasting data sets. It will alsofocus on some important real world application are...

متن کامل

Investigation of effective factors in expanding electronic payment in Iran using datamining techniques

E-banking has grown dramatically with the development of ICT industry and banks offer their services to customers from different channels. Nowadays, considering the great economic benefits of electronic banking systems, the need to pay attention to the expansion of electronic banking is increasingly felt in terms of reducing costs and increasing the bank's profitability. The purpose of this stu...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005